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ABSTRACT 

We introduce an information-theoretic measure, the Renyi information, to 
describe the galaxy distribution in space. We discuss properties of the informa- 
tion measure, and demonstrate its relationship with the probability distribution 
function and multifractal descriptions. Using the First Look Survey galaxy sam- 
ples observed by the Infrared Array Camera onboard Spitzer Space Telescope, 
we present measurements of the Renyi information, as well as the counts-in-cells 
distribution and multifractal properties of galaxies in mid-infrared wavelengths. 
Guided by multiplicative cascade simulation based on a binomial model, we verify 
our measurements, and discuss the spatial selection effects on measuring infor- 
mation of the spatial structures. We derive structure scan functions at scales 
where selection effects are small for the Spitzer samples. We discuss the results, 
and the potential of applying the Renyi information to measuring other spatial 
structures. 

Subject headings: large-scale structure of universe - galaxies:clusters:general - 
methods:statistical:simulation 



1. Introduction 



The large-scale spatial distribution of galaxies is an important topic for modern cosmol- 
ogy. The cosmic structure as revealed by the observed galaxy spatial distribution is believed 
to originate from primordial density fluctuations. Gravitation amplifies these fluctuations 
and is the main driver for the formation and evolution of the cosmic structures. In the 
current popular scenario, galaxies form inside the previously collapsed "dark" gravitational 
wells in a process joined and modified by gas dynamics, radiative cooling, and photonization. 
The coalescence of these dark halos brings galaxies together and to merge in a hierarchical 
manner. 

The large-scale distribution of the galaxies can be characterized by various statis- 
tical and topological methods. In particular, the 2-point correlation function has been 
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extensively used. It measures the second moment of the probability distribution, and 
statistically completely describes a Gaussian density field, which is believed to represent 
the primordial density fluctuations. However, the density field smoothed over the ob- 
served galaxy spatial distribution is highly non-Gaussian. The evolved cosmic structure 
as probed by galaxy distribution contains highly dense regions crowded by galaxies de- 
lineating spatial voids where few galaxies are located. We generally need the probabil- 
ity distribution function, or its moments, to completely characterize such galaxy distribu- 
tion in space. A counts-in-cells method has been used to establish the galaxy probability 
distribution function. Theory (Saslaw & Hamilton 1984; Saslaw & Fang 1996) and models 
(Carruthers & Minh Duong- Van 1983; Fry 1984; Fry 1985) have been developed to interpret 
the probability distribution. 

A multifractal description for galaxy spatial distribution has been studied both theoreti- 
cally (Pietronero 1987; Jones et al. 1988; Borgani 1993) and numerically (Valdarnini et al. 1992), 
and applied to several galaxy samples (Martinez et al. 1990; Borgani et al. 1993). In par- 
ticular, Borgani (1993) studied the multifractal behavior of various hierarchical probability 
distribution functions and derived the behavior of multifractal dimensions for extreme un- 
derdense and overdense regions. Indeed, the geometrical concepts of fractal and multifractal 
are appealing given the ubiquitous presence of such structures in various natural and so- 
cial phenomena (Mandelbrot 1983). Less- well-perceived has been the statistical origin of 
multifractals as characterizing the moments of a probability distribution. For a review of 
multifractal applications in large-scale structure, see Coleman & Pietronero (1992) and Bor- 
gani (1995). 

The purpose of this paper is to introduce Renyi information as a valid characterization of 
any spatial structure, including galaxy distribution. We show that Renyi information, being 
closely related to probability distribution and multifractal measures, probes the statistical 
moments sensitive to any levels of under- and overdense spatial structures. At scales where 
the information contents are well-preserved and can be accurately quantified, statistical 
moments are jointly described by Renyi information and dimensions, for which the underlying 
generator has a physical origin. We also illustrate the procedure by applying the Renyi 
information, along with the probability distribution and multifractal measures, to observed 
galaxy samples in the infrared wavelengths as well as a simulation. 

In the next section, we introduce Renyi information and the properties, the relations to 
the moments of the probability distribution function and to the multifractal measurements. 
In Section 3 we present the results of the probability distribution and Renyi information for 
the infrared samples observed by the Infrared Array Camera (IRAC) onboard Spitzer Space 
Telescope. We discuss a multiplicative cascade simulation in Section 4, which provides means 
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of validating our methods of measurement, and showing the effects of spatial selections in our 
galaxy samples. We further derive the functions scanning the structure of the moments for 
the samples based on simulation results. We discuss our results and potential applications 
of the information measure in Section 5. 



2. Renyi Information, Renyi Dimensions, and Structure Scan Functions 

Shannon & Weaver (1948) derived an information measure to describe the amount of 
information needed in order to know the occurrence of an event with a given probability. 
In an important development, Renyi (1970) expanded Shannon's information measure to 
arbitrary orders. Suppose we have N c cells placed to cover a distribution of N g galaxies. This 
can either be a 2-dimensional angular or 3-dimensional spatial distribution. The probability 
Pk of finding a galaxy in a given cell k containing galaxies is pk = Nk/N g . The Renyi 
information is defined as 



where (3 is the information order which in principle can be any real number (although in our 
application we consider integers only). At positive orders the overdense structures dominate 
the information estimate, whereas the underdense structures contribute the most to the 
information measure at negative orders. At (5 — 1, the Renyi information reduces to the 
Shannon information. 

The summation term for the probabilities p& to order j3 can also be written as 



where /(iVj) is the galaxy probability distribution function. Therefore the Renyi information 
of order (3 is related to the /5-moment of the probability distribution as 
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which is in turn related to the volume-averaged /3-point correlation function (Peebles 1980). 
The relation is intuitively easy to understand as is simply the total probability of finding 
(3 galaxies in a cell. At positive orders of integral f3 the Renyi information characterizes the 
amount of information corresponding to the event of finding f3 galaxies in the cells covering 
the discrete galaxy distribution. 

Some properties of the Renyi information indicate the behavior of the moments of the 
galaxy spatial distribution. It can be proved (Beck 1990) that 

(J>f)V(/^) < (£pf )V(/*-D, (5) 
($>f)^ > (J>f A and (31(32 > 0. (6) 

Taking the logarithm we get 



Ipi < I?2, (31 < (32, (7) 
^J-^i > ^f^ 1 ^ ( 3l< ( 32 and W2>0. (8) 

Since < < 1 we have J^pf < J^p^ = 1 for /3 > 1, and J^p^ > = 1 for < 1. 
Therefore there is an upper limit Ip < for all /3. We need zero information, or have perfect 
knowledge for an event when Ip = 0. The bounds are also reflected by 

Ipi< fff!/^ 1 ^ 1 f ° r 1 < (31 < (32 and /31< (32 < 0, (9) 

^-^brw 1 ^ for 0</91</32<L (10) 

The Renyi information depend on the cell size /, and diverge as / — > 0. One property 
that remains finite at this limit is the so-called Renyi dimensions: 

D((3) = hm (11) 
i— >o log / 

Any galaxy distribution becomes discontinuous at the scale of the typical galaxy separa- 
tion. The above limit is not achieved in a discrete distribution or in practical measurements. 
A more practical definition for galaxy distribution is the "effective" Renyi dimensions, for 
which we calculate the slope of Ip versus log/. There is no reason a priori to expect the 
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slope for a given order to be a constant over all scales for a given structure. In fact, this 
is not implied in equation 11 for a continuous multifractal distribution. We call it a simple 
multifractal if the effective Renyi dimension for any given order has a single slope across all 
scales. 

Examining Renyi information and dimensions over information orders is identical to in- 
specting the structure of statistical moments of a distribution. Studying such scan functions 
has the advantage of summarizing infinite amount of parameters (moments) in just a few 
relations for a statistical distribution. Here we relate Renyi dimensions to a scan function 
defined in a continuous multifractal field. Suppose (ej) is the field density measured and 
ensemble-averaged at scale I. Function K(/3) is the scaling exponent for moments of the 
field (also called the structure function) (ef) oc l~ K ^\ Now that the Renyi dimension is 
practically D{(5) = dlp/dlogl, since (ef) oc (iV 3 )/" 13 ' 3 oc Y.p1 1 ~ DP+L \ where N is the counts 
in the cells of size /, D is the dimension of the space in which the distribution is embedded 
(e.g. D = 2 in our applications below), we obtain 

K{(3) = {(3-l){D-D{(3)). (12) 

Function K(j3) is therefore also called the codimension (Schertzer & Lovejoy 1987). 
Here we use a general name, the structure scan function, for the Renyi information and 
dimensions as functions of f3, as well as for functions like K{j3). 

Th multifractal dimensions are usually defined by using the generalized correlation in- 
tegral (Hentschel & Procaccia 1983; Grassberger & Procaccia 1983). Many measurements 
of the multifractal properties of galaxy distribution have been based on measuring the gen- 
eralized correlation integral, which uses cells of varying sizes centered at selected galaxies. 
Such a procedure is not valid for estimating Renyi information since neighboring cells bound 
to cross each other above a certain scale. Below this scale there is a non-zero probability 
that some of the galaxies are not covered by the ensemble of cells. Either case changes the 
normalization for the probabilities, and the Renyi information is not accurately quantified 
for the original structure. This is further explained in Section 4. We want to emphasize 
here that not only the slope of the Renyi information versus scale (Renyi dimensions), but 
also the Renyi information itself is a physical measurement, both being open to physical 
interpretations. We will further discuss this point in Section 5. 

We note that the differential form of the second-order correlation integral is called the 
conditional density, which had been used to characterize galaxy distribution in early surveys 
(Coleman et al. 1988; Lemson & Sanders 1991). 

The Renyi dimensions basically show the scaling properties of Renyi information. Loosely 
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speaking, a multifractal galaxy distribution has a position-dependent scaling exponent a(x) 
in pk ~ l a . It can be shown strictly (Schuster 1995) that the spectra of these scaling expo- 
nents f(a) and the Renyi dimensions (multiplied by a factor of {(3 — 1)) are related by a 
Legendre transformation 

f(a) = -T(P)+/3a. (13) 

where r((3) = ((3 — \)D{0), dr/d(3 = a, and df /da = (3. A number of interesting properties 
of D([3) and f(a) are discussed in Beck (1990). A few interesting ones include D(f3) being 
a decreasing function of (3 and bounded as (3 — > ±oo, D(+oo) = a m j„, and D(— oo) = a maa ;. 
These limits and the ways Z?(/3) and /(a) approach the limits show the properties of the 
moments of the spatial distribution from a scan function perspective. 



3. Measurements and Results 

The IRAC instrument onboard the Spitzer Space Telescope provides fresh view into the 
cosmos in the mid-infrared wavelengths of 3.6 /im, 4.5 /im, 5.8 /im, and 8.0 /im. The Spitzer 
First Look Survey (FLS) using IRAC provides a uniform coverage of a 4 square-degree field 
centered at RA = 17 h 18 m , Dec = 59°30' with a total 60-second exposure time for each pixel 
in the 256 x 256 arrays (Lacy et al. 2006). For our present purpose, we use the full galaxy 
samples established for an earlier 2-point correlation analysis (Fang et al. 2004) across the 
IRAC wavelengths. 

We divide the two-dimensional area covered by an IRAC sample into square cells of 
varying sizes. The cells are non-overlapping and contiguous for the purpose of accurately 
estimating the Renyi information. The boundary of the sample area and the usage of a cell 
are determined by the mask files used to establish the galaxy sample (Fang et al. 2004). We 
always have > 500 "good" cells at the largest scales of measurement to ensure good statistics. 
At smaller scales the cell numbers are much greater. 

For each sample and each cell size we count the number of galaxies in the cells and es- 
tablish the histograms in Figure 1. The histograms represent the estimates of the probability 
distribution function, from which the moments of the distribution can also be measured. For 
each histogram, we plot the fit of the theoretical Gravitational Quasi-equilibrium Distribu- 
tion Function (GQED) (Saslaw & Hamilton 1984; Saslaw k Fang 1996). The single fitting 
parameter b, the average ratio of the gravitational correlation potential energy to twice the 
kinetic energy, is also shown in the plots. For comparison, we also draw the Poisson dis- 
tributions with the same mean galaxy counts in cells of given sizes. Apparently the galaxy 
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distribution deviates more from the Poisson distribution at larger cell sizes, indicating the 
effect of galaxy clustering in IRAC wavelengths. The GQED, on the other hand, describes 
the distributions of IRAC galaxies remarkably well over all scales. 

We follow the same procedures as in the counts-in-cells experiment to divide the IRAC 
sample areas into square cells to calculate the Renyi information. Figure 2 shows the relation 
between the cell sizes and the measured Renyi information at orders from 1 to 20. The Renyi 
information scales with cell sizes, but the relation is not linear for our galaxy samples. We 
will discuss below the effects that can potentially change the scaling relation. The apparent 
crowding of the curves at high information orders implies an upper limit, which is based 
on the above discussion, for the information measures. The limit constrains the behavior of 
the moments of galaxy distribution in the information space. 

Intuitively, exclusion of structures, such as galaxies not covered by the cells, or regions 
that cells avoid due to masking, changes the information content of the structure. Although 
we intend to cover the sample using contiguous cells, changing cell-size causes some galaxies 
in the sample not being covered by cells of a new size due to the sample boundary and 
masked areas. Until these effects can be fully accounted for, we are in fact measuring 
the information of slightly different structures at each scale, even though the probability is 
normalized by the total number of galaxies covered by cells. This introduces noises in the 
information measurements. In the next section, we study these effects using simulation of a 
known structure. 



4. Simulation and Further Results 

To verify our results, we generate a multiplicative cascade simulation based on a binom- 
ical model. The binomial model was found to describe well the multifractal scaling in the 
dissipation field of fully developed turbulence (Meneveau & Sreenivasan 1987). We use the 
binomial model for its simplicity and analytically derivable relations for Renyi information 
and multifractal properties. The multiplicative cascade method was formulated to study 
energy transfers at different scales in turbulence (Mandelbrot 1974; Frisch 1995). It is by far 
the most effective method to simulate a multifractal field. 

We use a discrete multiplicative cascade simulation, consistent with our purpose to study 
counts at multiple length scales. The simulation aims to create distributions of counts at ten 
different scales within a given area with a conserved overall number density -Do- At the first 
level, the area is divided into four quadrants of the same size, two of which hold a fraction 
p/2 of D , and the other two have the fraction (1 — p)/2 (so the overall number density 
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is conserved as D ; also called the canonical process). There are many ways to distribute 
the two number densities equally among four cells. We choose to have a fixed pattern of 
distribution here. We tested different patterns and the results are the same. 

At the second level, each quadrant area is further divided into four identical (smaller) 
cells, with the distribution of the same probabilities of the same pattern. The number counts 
in a cell is the product of the probability assigned at this level multiplied by the probability 
(of the quadrant covering the cell) at the previous level (and by an arbitrary total source 
number, which we choose to use 1). 

We continue the process to generate smaller and smaller cells and their number counts. 
We stop at the tenth level where we have data over 10 scales (of ratio 2) for statistics. 
At level n, a cell has the number count proportional to (|) fc (^ £ ) n_fc , where k is an integer 
between and n. Therefore it is called the binomial model. The resulting structure, although 
modulated sharply by cell edges, is a simple multifractal. Based on Halsey et al. (1986) and 
Meneveau & Sreenivasan (1987), we derive the Renyi information, the Renyi dimensions, 
and the spectra of the multifractal scaling exponent for the 2-dimensional binomial field as 



D =l + 



1-P 



(14) 
(15) 



p / \ i , 1 ,log(l -p) -log(p) log(p) + a \og(p) + a 

f(a) = l + log(— — — +i — n \ — ] — TT lo S ] — n \)> 16 

log(l — p) + a log(l — p) — log(p) — a — log(l — p) 

where n is the level number in the cascade (e.g. smallest scale being n — 10), and (3 is the 
information order. 

Figure 3 shows the scaling of Renyi information in the binomial field. The measurements, 
using the same methods and algorithm used for the infrared samples, are indicated by points 
in the Figure. The lines are calculated based on equation 14. The agreement is nearly perfect 
for all orders. 

Next we compare measurements of the Renyi dimensions and multifractal spectra with 
those predicted by equations 15 and 16. Since the scaling of the measured Renyi information 
in Figure 3 is well-represented by lines, we use linear least-square fit for each information 
order in that figure to obtain the Renyi dimensions. We confirm from the fit that the 
information values at the smallest scale (where logr = 0) and the slope are both within 
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10~ 3 % of those predicted by equations 14 and 15. The slope values from the fit for each 
order are plotted as dots in the left-panel of Figure 4. The line is the scan function for Renyi 
dimensions based on equation 15. To obtain the multifractal spectra, we use a cubic spline 
fit to model the measured Renyi dimensions and derive the a and f(a) values within the 
range of information orders. These values are again represented by the dots in the right- 
panel of Figure 4. The line in the Figure is based on equation 16. In both panels the values 
based on the measured R'enyi information agree well with the predicted ones. This shows 
the reliability of the methods and algorithms we use. 

In Figure 5 we show measurements of generalized correlation integral superimposed on 
the plane of Renyi information versus scale for our simulation. We generate 1000 cell posi- 
tions in the 1024 x 1024 field, and vary the size of the cells centered around these positions. 
Cells may overlap, but are ignored if they cross the field boundary. The generalized cor- 
relation integral is calculated using the remaining cells, following the standard algorithm 
(Martinez et al. 1990). In the Figure the dash-lines are generalized correlation integral mea- 
surements, and solid-lines are the predicted Renyi information. It is clear that although 
the Renyi dimensions may be approximately maintained, the generalized correlation integral 
does not measure Renyi information. We also find that the values of the generalized corre- 
lation integral depends on the number of cells in the experiment, further strengthening this 
point. 

To investigate the effects of spatial selection of structures, we include the mask files 
on which the IRAC samples are based. Each of the mask files is a FITS representation of 
the FLS field with a dimension of 6200 x 6600 pixels. Since our simulation has a different 
dimension, we first project these masks onto a 1024 x 1024-pixel field. This procedure 
maintains the scale ratio of the masked areas and the field size. We then follow the same 
criteria to exclude cells in the simulation field overlapping with projected masked areas, and 
repeat the procedures of measuring the Renyi information in the simulated field. 

The results, using the mask files for the four IRAC samples, are shown in Figure 6, with 
solid-line predictions superimposed on measured points connected by dotted lines. There is 
an obvious effect of spatial selection on measuring both Renyi information and dimensions. 
Notably involving the IRAC masks introduces an apparent scale-dependency of Renyi di- 
mensions, particularly at greater scales, where both the Renyi information and dimensions 
are higher than predicted. At smaller scales, there is a systematic offset to higher (negative) 
information values, although the slopes for Renyi dimensions are approximately maintained. 
Masking reduces the amount of structures in the original binomial field, and smaller amount 
of information (identical to the absolute value of the measured Renyi information) is needed 
to know an event occuring (such as f3 sources in a cell) with a given probability. Cells of 
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increasing scales may cover a fluctuating, but generally increasingly smaller samples from 
the original structure accompanied by a mask. This confirms our intuition that Renyi in- 
formation is an intrinsic property of a spatial structure. Any modifications of the structure 
modify its information content. Other derived properties such as the Renyi dimensions can 
also be affected if not measured properly. While the geometry and pattern of the four mask 
files vary, the effects on the Renyi information are remarkably similar. 

Although it changes the information contents of the original structure, it appears that 
the IRAC masks preserve the scaling of the information at smaller scales. The graininess of 
galaxy distribution, however, introduces a Poisson limit for these smaller scales (remember 
our simulation is not grainy), below which a cell contains either one or no galaxy in most 
of the regions. Any multifractal behavior breaks down at this limit. At scales smaller, the 
number of cells contributing to the Renyi information is roughly identical to the total number 
of galaxies, and the Renyi information reaches a (lower) limit also and flattens out (also see 
Figure 8 below). Both these effects at large and small scales can make the Renyi information 
curves of a multifractal become concave. This curvature is observed in Figure 2. 

Based on the IRAC sample sizes and the un-masked areas for the samples, we estimate 
the mean separations of any two galaxies in the samples, which are roughly 20" for IRAC-1 
and 2, 55" for IRAC-3 and 4 samples, assuming uniform distributions. We use these as the 
lower scale limit for reliable multifractal estimate. For upper limit, Figure 6 implies a linear 
scale of ~ 1% of the field size, assuming that the scale ratio applys to the FLS field. This 
is only slightly higher than the lower limit of IRAC channel-3 and 4 samples. Basically, the 
smaller number of galaxies in these samples combined with the amount of masking prevented 
us from reliably estimating the multifractal behavior for these two samples. 

For illustration purpose, we perform a cubic spline fit to each of the Renyi information 
relations in Figure 2, and derive the Renyi dimensions and the scan function for IRAC-1 and 
2 samples at a scale of 45" . We perform another cubic spline fit to the scan functions (like we 
did for the binomial field) and obtain a and f(a) throughout the range of Renyi dimensions. 
In Figure 7 we show these relations for the two FLS samples. The figure illustrates how 
Renyi dimensions decrease with increasing information order, and converge to a limit. Also 
f(a) appears to be a convex function of a. Where f{a) = 0, the a values represent the 
Renyi dimension limit when f3 — > oo. All are typical behaviors of multifractals. 

In Figure 8, we plot the Renyi information as a function of order, a different type of 
scan function, measured for orders —20 to 20 at scales of 20" , 32" , 44" , 55" , and 68" for all 
IRAC samples. At most of these scales the masking effect is small, where the information 
can be measured accurately for galaxy distribution. For samples at IRAC channels 3 and 4, 
however, Poisson effects dominate the three smaller scales. This is told by the scan curves 
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converging to the Poisson limit below or near information order zero. For all IRAC samples, 
the limit is shown by the scan curve behavior at negative information orders, where the 
information measure is sensitive to and dominated by underdense regions in the samples. At 
positive orders and at scales where information can be measured accurately, the scan curves 
tell the structures of high moments of the galaxy distribution. 

5. Discussion 

We have shown that the Renyi information, the effective Renyi dimensions, their struc- 
ture scan functions, and the multifractal spectra contain the properties of the high moments 
of a spatial distribution. These measurements can be used to scan properties of these high 
moments. These properties detect the amount of deviation from Gaussian densities, and are 
highly constrained in the parameter space in these measurements. 

Our experiments also show that spatial selection effects are important and can bias these 
measurements. Any selection modifies the original structure and the amount of Renyi infor- 
mation the structure contains. Depending on the amount of selection, the Renyi dimensions 
may be maintained over a limited range of scales above the Poisson limit for discrete distri- 
butions. One needs to conduct controlled experiments such as simulations to verify at these 
scales. For IRAC-1 and 2 samples, there is indication in Figure 2 that the information-scale 
relation is still not linear within the range. It is yet uncertain how much of this is caused by 
masking as well as by approaching the Poisson limit, both effects leading systematically to a 
concave curve, or if there is scale-dependency for the Renyi dimensions in our IRAC samples, 
which would imply a more complex structure than a simple multifractal distribution at these 
scales. 

Whether galaxy spatial distribution is a multifractal, or whether homogeneity can be 
reached at large scales, as cosmological principle states, has been observationally a contro- 
versial issue (Peebles 1993; Coleman & Pietronero 1992; Avnir 1998; Martinez 1999). Our 
analyses show that caution needs to be exercised extrapolating a multifractal structure to 
small and large scales, particularly if spatial selection exists for a galaxy sample, even if 
multifractality is observed at scales more reliable for multifractal measurements. 

It may be possible to recover the lost information in a galaxy sample by "filling-in" the 
masks based on known properties of galaxy distribution. Such known properties may come 
from minimally-masked samples of galaxies of the same type, or from iV-body simulations, 
for examples. Just as the (^-function for generating the probability in our multiplicative 
cascade simulation, there is a variety of statistical functions that can serve as the generating 
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functions for simulating full-scale multifractal fields (Gupta & Waymire 1993). Among these 
generating functions the log-Levy distribution is of particular interest due to the unique 
position of the Levy distribution in replacing a Gaussian in the generalized central-limit 
theorem where variances of the component distributions can be infinite, and also due to its 
applications to a "universal class" of geophysical structures (Schertzer & Lovejoy 1987). The 
structure scan functions are uniquely determined by probability generating functions which 
are of physical origin. The generating function would be a significant property to know if 
galaxy distribution is a multifractal to large scales. 

Another way to seek physically interpreting the Renyi information is to use the moments 
of the probability distribution via equations 3 and 4, which are not restricted to a multifractal 
structure. Since N c oc l~ D , where D is the dimension of the space the distribution is 
embedded, we can also derive for equation 3 



D P--—l + (3-l dlogl ' (17) 

where mp is the /3-moment of the probability distribution function. It is clear from the 
relation that we have a simple multifractal distribution across all scales only if d log mp/d log/ 
is not a function of scale. This is not the case for GQED, for example. On the other 
hand, any physically-derived probability distribution can interpret the Renyi information 
and dimensions via these relations. 

Independent of the multifractality of a structure, the Renyi information and dimensions 
are general characterizations of statistical properties of the structure. A simple multifractal 
is a special and very restrictive type of structure in its practical definition. The Renyi 
information and dimensions and their corresponding scan functions can describe any types 
of structures, whether or not multifractals. The Renyi information is extensive, whereas 
its scaling, or "information rate" with changing scales is an intensive parameter. Both are 
important for a given structure. As we collect galaxy samples from surveys with greater 
area coverage and increasing depth as well as in more wavelength channels, we are collecting 
increasingly more information about the large-scale structure, and the absolute values of 
the measured Renyi information increase at a given scale. Any variations of the Renyi 
dimensions, on the other hand, are of different origin. 

For spatially confined structures, such as a giant molecular cloud, the extensivity of 
Renyi information also depends on resolution. A more resolved observation reveals more 
detailed structure, and therefore more information contents. While the Renyi information 
and dimensions can be identically applied to continuous and discrete spatial fields, it is 
important to recognize what properties are used for measurement. It is clear that we want 



-13- 



to characterize the moments of a spatial structure, and that we can use spatial densities for 
measuring. An astronomical observation is usually a radiation measurement, however, and 
the proportionality between the two is only an assumption. For non-astronomical structures, 
the meaning of the measurements can be more clear-cut. 

The Renyi information and dimensions can also be applied to one-dimensional time- 
series. In the temporal domain, the amount of time-delay serves as scaling, and the informa- 
tion contents and rate describe the temporal structure built by distributions of the change 
of the observed properties over certain and different time-spans. An information measure is 
a measure about the knowledge of a structure or system, and therefore its predictability. It 
would be desirable to quantify the predictability of a statistical distribution or a time-series 
using Renyi information and dimensions. So far research on this topic remains limited. 

The relation between the Renyi information and dimensions measured in 2-dimension 
and those in 3 dimensional space for the same structure can be straightforward. The 2- 
dimensional cells used to cover a structure can also be 3-dimensional cells with the third 
dimension extended to cover the same structure. When properties such as spatial density 
can be accounted for by measurements when projecting the structure onto a 2-dimensional 
area, the information is not lost. The only uncertainty is the correspondence between the 
2-dimensional and 3-dimensional scales. It is, however, a generally interesting question what 
scales are measured by cells of non-identical dimensions. For galaxy spatial distribution, 
the evolutionary effects of galaxies in the third dimension need to be disentangled from 
projection before the structure can be analyzed in three dimensions. 

I thank the anonymous referee for providing constructive comments. I thank the FLS 
team at the Spitzer Science Center for assembly of the FLS data products. The Spitzer 
Space Telescope is operated by the Jet Propulsion Laboratory (JPL), California Institute 
of Technology under NASA contract 1407. Support for this work was provided by NASA 
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Fig. 1. — The 2-dimensional counts- in-cells distribution of the IRAC galaxies. Square Cells 
of three different sizes on the side, indicated in the Figure, are used for each IRAC channel 
data. Histograms show the counts of the IRAC sources. The solid lines are the fit using the 
Gravitational Quasi-equilibrium Distribution Function; fitting parameter b is shown in the 
Figure for each case. The dotted lines are the Poisson distributions with the same average 
number of galaxies N avg in given size cells as in the corresponding histograms. 

Fig. 2. — Relation between Renyi information and the cell sizes used to measure the in- 
formation. Each curve represents an information order. For each order we measure the 
information at different cell sizes ranging from 20 to 250 arcsec for IRAC-1 and 2 samples, 
and from 30 to 250 arcsec for IRAC-3 and 4 samples. The measurements are connected by 
lines. The orders are from 1 to 20 from bottom to top. The Renyi information are in "bits" 
with log 2 as the unit. 

Fig. 3. — Renyi information for orders —20 to 20 (bottom to top) as a function of scale in 
the simulated binomial field with p = 0.2. The crosses are measurements using the same 
algorithm as for Figure 2. The lines are predictions of the binomial model with p = 0.2. The 
logarithmic base is 2. 

Fig. 4. — Renyi dimensions as a function of information order and the spectra of scaling 
exponents in the simulated binomial field. The dots are measurements, and the lines are 
predictions based on the binomial model. 

Fig. 5. — Generalized correlation integral measurements in the same binomial-field simula- 
tion. A total of 1000 cell positions are distributed in the 1024 x 1024 grid area. The cells 
are centered at these positions with their sizes vary for each measurement. The values of 
the generalized correlation integral are measured at nine scales for information orders -6, -3, 
2, 3, and 7, shown by the dashed lines. Solid lines are the predicted Renyi information at 
these scales. Here the logarithmic base is 2. 

Fig. 6. — Renyi information for orders —20 to 20 (bottom to top) as a function of scale 
in the simulated binomial field modified by FLS IRAC masks scaled to the size of the 
simulation field. The crosses are measurements, connected by dotted lines. The solid lines 
are predictions as in Figure 3. The logarithmic base is 2 for the figures. 

Fig. 7. — Renyi dimensions as a function of information order and the spectra of scaling 
exponents, estimated at a scale of 45" for IRAC channel-1 and 2 galaxies. A cubic spline 
fit is performed for each curve in Figure 2 for estimating the Renyi dimensions at different 
information orders. This generates the left-panel of the figure for the scan function. A second 
cubic spline fit is used to estimate the scaling exponents and their spectra from the scan 
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function, which produces the right-panel figure. 

Fig. 8. — Structure scan functions of the Renyi information. From top to bottom the lines 
represent the Renyi information measured at scales of 68" , 55" , 44" , 32" , and 20" for orders 
from —20 to 20. At nagative information orders where underdense structures dominate, the 
graininess of galaxy distribution leads the Renyi information to the Poisson limit. 
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